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ABSTRACT 



Methods and apparatus for hosting a network service on a 
clu^er of servers, each including a primary and a secondary 
Internet Protocol (IP) address. A common cluster address is 
assigned as the secondary address to each of the servers in 
the cluster. The cluster address may be assigned in UNIX- 
based servers using an ifconfig alias option, and may be a 
ghost IP address that is not used as a primary address by any 
server in the duster. Client requests directed to the cluster 
address arc dispatched such that only^ie^f the servers of 
the cluster respond s to a giv en client request The dispatch- 
ing may use a routing-based technique, in which all client 
requests directed to the cluster address are routed to a 
dispatcher connected to the local network of the server 
cluster. The dispatcher then applies a hash function to the 
client IP address in order to select one of the servers to 
process the request. The dispatching m ay alte majdyely use a 
broadcastji^based technique, in which a router broadcasts 
client requests having the cluster address to all of the servers 
of the cluster over a local network. The servers then each 
provide a filtering routine, which may involve comparing a 
server identifier with a hash value generated from a cUent 
address, in order to ensure that only one server responds to 
each request broadcast by the router. 

39 Clahns, 3 Drawing Sheets 
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HOSTING A NETWORK SERVICE ON A to map the server domain name to an IP address. The TCP 

CLUSTER OF SERVERS USING A SINGLE- and IP packet routing functions in ciicnt 12 and server 14 are 

ADDRESS IMAGE provided in respective TCP layers 24-1, 24-2 and IP layers 

26-1, 26-2. The TCP and IP layers are generally associated 
5 with the tran^rt and network layers, respectively, of the 
FIELD OF THE INVENTION well-known Open Systems Interconoection (OSI) model. 
... The TCP layers 24-1, 24-2 process TCP padcets of the client 
The present invention relates generally to data commu- request and server response. The TCP packets each inchide 
nication nctworics such as the Interact and more particularly ^ TCP header identifying a port number of the TCP con- 
to techniques for hosting network services on a cluster of section between the client 12 and server 14. The IP layers 
servers used to deUver data over a network in response to 26-1, 26-2 process IP packets formed &om the TCP padcets 
client requests, where the duster of servers can be coUec- t^e TCP layers. The IP packets each include an IP header 
tively identified by a client using a single-address image. identifying an IP address of the TCP/IP connection between 

BACKGROUND OF THE INVENTION ^ ^7^' 

15 The IP address for a given network service may be 

Wth the explosive growth of the World Wide Web, many determined, as noted above, by the client accessing a ooo- 

popular Internet web sites arc heavily loaded with clirat ventiooal DNS. The IP layer 26-1 of tiie client 12 uses the 

requests. For example, it has been reported in S. L. resulting IP address as a destination address in the IP packet 

GarfinkeU "The Wizard of Netscape," Webserver Magazine, headers of client request padcets. The IP address together 

July/August 1996, pp. 59-63, that home pages of Netscape with the TCP port number provide the complete tran^Jort 

Communications receive more than 80 million client address for the HTTP server process. The client 12 and 

requests or "hits" per day. A single server hosting a service server 14 also indude data link and physical layers 28-1 for 

is usually not suf&cient to handle this type of aggressive performing framing and other operations to configure client 

growth. As a result, clients may experience slow response request or reply packets for transm^on over the networks 

times and may be unable to access certain web sites. ^5 16 and 20. The router 18 includes data link aiKl physical 

Upgrading the servers to more powerfiil machines may not layers 28-3 for converting client request and server reply 

always be cost-eflfective. Another common approach packets to IP format, and an IP layer 26^ for performing 

involves deploying a set of machines, also known as a padcet routing based on IP addresses. The server 14 

cluster, and configuring the machines to wodc together to responds to a given client request by supplying the requested 

hoe^ a single service. Such a server cluster should preferably 3Q information over the established TCP/IP coimection in a 

publicize only one server name for the entire cluster so that number of reply padcets. The TCP/IP connection is then 

any configuration change inside the cluster does not affect closed. 

cUent applications. The World Wide Web and other portions jhere are many known techniques ft)r distributing HTTP 

of the Internet utilize an application-level protocol, known client requests to a cluster of servers. FIGS. 2 and 3 ilhistratc 

as the Hypertext Transfer Protocol (HTTPX which is based 35 server-side single-IP-address image approaches which 

on a clicntyserver architecture. The HTTP protocol is present a single IP address to the clients. An example of this 

desCTibcd in greater detail in "Hypertext Transfer Protocol— approach is the TCP router approach described in D. M. 

HTTP/1.0," Network Working Group, May 1996, <http:// Dias, W. Kish, R. Mukbeijee and R. Tewari, "A Scalable and 

www.ics.uci.edu/pub/ iet£^ttp>, which is incorporated by Highly Available Web Server," Proceedings of COMPCON 

reference herein. ^ '96, pp^5-92,1996, which is incorporated by reference 

FIG. 1 illustrates an exemplary dicnt/server ardiitecture hereirL FIG. 2 illustrates the TCP router approach in which 

suitable for implementing HTTP-based network services on a client 12 establishes a TCP/IP connection over Internet 30 

the Internet. A client 12 generates an HTTP request for a with a server-side router 32 having an IP address RA. The 

particular service, such as a request for information associ- router 32 is ooimected via a LAN 36 to a server cluster 34 

aled with a particular web site, and a Transmission Control 45 including N servers 14-i, i -1, 2, . . . N, having respective 

Protocol/Internet Protocol (TCP/IP) connection is then IP addresses SI, S2, . . . SN. Each server of the duster 34 

established between the client 12 and a server 14 hosting the generally provides access to the same set of contents, and the 

service. The client request is delivered to the server 14 in this contents may be replicated on a local disk of each server, 

example via a TCP/IP connection over a first network 16, a shared on a network file system, or served by a distributed 

router 18 and a second network 20. The first network 16 may 50 file system. 

be a wide area communication network such as the Internet, Xhe single-address image is achieved by publicizing the 
while the second network 20 may be an Ethernet or other address RAof the server-side router 32 to the clients via the 
type of local area network (LAN) interconnecting server 14 DNS. The client 12 therefore uses RA as a destination IP 
with other servers in a server cluster. The router 18, also address in its request The request is directed to the router 32, 
referred to as a gateway, performs a relaying function 55 which then dispatches the request to a selected server 14-k 
between the first and second networks which is transparent of server duster 34 based on load characteristics, as indi- 
to the client 12. cated by the dashed line connecting client 12 to server 14-k 
The ciicnt reqiiest is generated by a web browser or other via router 32. Hie router 32 performs this di^atching 
application-layer program operating in an api^cation layer function by changing the destination IP address of each 
22-1 of the client 12, and is responded to by a file transfer 60 incoming IP packet of a given client request from the router 
system or other program in an application layer 22-2 of the address RA to the address Sk of selected server 14-k. The 
server 14. The requested network service may be designated selected server 14-k responds to the client request by send- 
by a Uniform Resource Locator (URL) which inchides a ing reply packets over the esublisbed TCP/IP connection, as 
domain name identifying the server 14 or a corresponding indicated by the dashed line connecting server 14-k to client 
server cluster hosting the service. The appHcation-level 65 12. In order to make the TCP/IP connection appear seamless 
program of the chent 12 initiates the TCP/IP cormection by to the client 12, the selected server 14-k changes the source 
requesting a local or remote Domain Name Service (DNS) IP address in its reply packets from its address Sk to the 
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router address RA. The advantages of this approach are that serveis io a server cluster configured in accordance with the 

it does not increase the oumber of TCP connections, and it invention may be designated by a single chister address 

is totally transparent to tbc clients. However^ since the which is assigned as a secondary address to each server. All 

above-noted source IP address change is performed at the IP client requests for a web site or other network service 
layer in a given server, the kernel code of every server in the s associated with the chister address are sent to the server 

chister has to be modified to implement this mechanism. A cluster, and a di^atching mechanism is used to ensure that 

proposed hybrid of the DNS approach and the TCP router each client request is processed by only one server in the 

approach, in which a DNS server selects one of several cluster. The dispatching may be configured to operate with- 

ciusters of servers using a round-robin technique, sufifers out increasing the number of TCP/IP connections required 

from the same problem. for each cUent request. The invention evenly distributes the 

FIG. 3 illustrates a server-side single-address image client request load among the various servers of the cluster, 

approach known as network address translation, as masks the faihire of any server or servers of the chister by 

described in greater detail in E. Anderson, D. Patterson and distributing client requests to the remaining servers without 

E. Brewer, "The Magicrouter, an i^lication of Fast Packet bringing down the service, and permits additicmal servers to 

Interposing,'' SyrnjK^iuin on Operatii^ Systems Design and added to the cluster without bringing down the service. 

ImplemenUUon,Opi^96,<http^/www. ^ ^ weU-^uted for use in hostiijw, site services, the 

^^r.^riihtl'^^^ n"^; technic^es of the present inventioETmay also be used to 

Local Director, <hltp://www.cisco.comAvarp/pubUc/751/ 2 r f ^ 

todir/iodex. html>, which are incorporated by referent '^PP°^ ' ""^^^ apphcaUons. 

herein. As in the TCP router approach of FIG. 2, the cKcnt *° ^ exemplary embodunent of the mvcntoon, a network 
12 uses the router address RA as a destination IP address io 20 service is hosted by a server chister in which each server 

a client request, and the router 32 dispatches the request to includes a primary IP address and a secondary IP address. A 

a selected server 14-k by changing the destination IP address common duster address is assigned as the secondary IP 

of each incoming request packet from the router address RA address for each of the servers. The cluster address may be 

to the address Sk of selected server 14-k. However, in the an IP address which does not correspond to a primary IP 
network address translation approach, the source IP is address of any of the servers. In UNIX-based servers, the 

addresses in the reply packets fiom the selected server 14-k cluster address may be assigned as the secondary address for 

arc changed not by server 14-k as in FIG. 2, but arc instead * given server using an ifconfig alias option. If a given server 

changed by the router 32. The reply packet flow indicated by includes multiple network interface cards, the cluster 

a dashed line in FIG. 2 thus passes from server 14-k to client address may be assigned to one of the network interface 
12 via router 32. 30 cards u^ing a UNIX ifconfig command without the alias 

Compared to the TCP router approach of FIG. 2, network °P^i*>^ ^^^^ similar technique. A router is coupled to a 

address translation has the advantage of server transparency. nctworic of the server cluster and is also coupled via the 

That is, no specific changes to the kernel code of the servers Internet to a cUent. The router receives cUent requests from 

arc required to implement the technique. However, both the ^® Internet, and uses a di^atching technique to direct client 
TCP router and network address translation approaches 35 requests having the cluster address as a destination. The 

require that the destination address in a request packet client requests are di^atched such that each of the requests 

header be changed to a server address so that the server can processed by only one of the servers in the cluster. The 

accept the request These approaches also require that the dispatching function may be based on the result of applying 

source address in a reply packet header be changed to the * function to an IP address of tbc given client. A 
router address so that the client can accept the reply These « suitable hash function may be determined using an analysis 

changes introduce additional processing overhead and * distribution of dicnt IP addresses in an access log 

unduly compUcate the packet delivery process. In addition, associated with one or more of the servers. In the event that 

because of the address changes, the above^lescribed single- * server has failed, the hash fiinction may be reappUed to the 

address image approaches may not be aiitable for use with ^ address to identify another server, 
protocols that utilize IP addresses within an application, *5 Two illustrative dispatching techniques for provkiing a 

such as that described in K. Egevang and P. Francis, "The IP single-address image for a server cluster in accordance with 

Network Address Translator," Network Working Group, invention include routing-based dispatching and 

RFC 1631, <http:/Avww.safety.net/rfcl631.txt>, which is broadcast-based di^atching. In the routing-based technique, 

incorporated by reference herein. Furthermore, in both the * dispatcher is coupled to the router and to a local network 
TCP router and network address translatioo approaches, the 50 of the server cluster. The router directs client requests having 

router 32 needs to store an IP address mapping for every IP cluster address to the dispatcher, and the di^atcher 

connectjoa Upon receiving an incoming packet associated selects a particular one of the servers to process a given 

with an existing TCP connection, the router has to search ^^"^ request based on the resuh of applying a hash function 

through all of the mappings to determine which server the ^ ^ ^^^^^ address. In the broadcast-based technique, the 
packet should be forwarded to. The router itself may there- 55 router broadcasts client requests having the cluster address 

fore become a bottleneck under heavy load conditions, ^ each of the servers over the local network of the server 

necessitating the use of a more complex hardware design, as cluster. Each of the servers implements a filtering routine to 

in the above-cited Cisco Local Direaor. ensure that each client request is processed by only one of 

It is therefore apparent that a need exists for improved mt&nsg routine may involve applying a 

techniques for hosting a network service on a cluster of hash fiinction to the client IP address associated with a given 

servers while presenting a single-address image to the cUent request, and comparing the result to a server identifier 

cUents, without the problems associated with the above- ^ determine whether that server shouM process the cUent 

described conventional approaches. request 

^ The techniques of the present invention provide fast 

SUMMARY OF THE INVENTION dispatehingand can be implemenied wilh n«Jed cos. and 

The present invention provides methods and apparatus for complexity. The techniques are suitable for use in TCP/IP 

hosting a network service on a cluster of servers. All of the networks as well as networks based on a variety of other 
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standards and protocols. Unlike the conventional single- secondary address is also referred to herein as a cluster 

address image approaches, the present invention does not address, and may be established using an ifcoofig alias 

require that a destination address in a request packet header option available on most UNIX-based systems, or similar 

be diang^ to a server address so that the server can accq)t techniques available on other systems^ The chister address 

the request, or that a source address in a reply packet header 5 °^*y publicized to clients using the above-noted Domain 

be changed to the router address so that the client can accept Name Service (DNS) which translates domain names asso- 

the reply. In addition, the router need not store an IP address Uniform Resource Locators (URLs) to IP 

mapping for every IP connection, nor is it required to search addresses. All client requests to be directed to a service 

through such a mapping to determine which server a packet hosted by the server cluster are sent to the single cluster 

should l>e forwarded to. The router itself win therefore not lo address, and dispatched to a selected one of the servers using 

become a bottleneck under heavy load conditions, and routing-based or broadcast-based dispatching techniques to 

special router hardware designs are not required. These and described in greater detail below. Once a server is 

other features and advantages of the present invention will selected, future request packets associated with the same 

become more apparent from the accompanying drawings client request may be directed to the same server. All other 

and the follorwing detailed description. 15 commimications within the server cluster may utilize pri- 
mary IP addresses of the servers. 

BRIEF DESCRIPTION OF THE DRAWINGS The above-noted ifconfig alias option is typically used to 

FIG. 1 is a block diagram iUustrating a convenUonal ^iOffln y.rvfir trr fffifve more th.n or^ ^om . in mrne 

rer interconnection in .«T,,d,n^ .h. iY^/ii> For example, the if^nfig alia^ Qp^qq aUgwg a ^plel^" 



client-server interconnection in accordance with the TCP/IP „ u t7- 1 m , — ; — . 

standard* ^^'^fir ^ t o attach mulUple IP addresses, and thus multiple dom ain 

Fir ^ ;il„ctror<.c o ^ TYTJ . . u • ^ uamcs. to a single network intfrtanr. a^: Hr^L-rf ,n ^-v^^ 

HG. 2 ilhistrates a pnor art TCP router techmque for Sci^^, One Interface" <ht^://www.thcsphere.oom/-^lp/ 
hosting a network service on a chister of servers; TwoServers/>. which is incorporated by reference herein. 

FIG. 3 illustrates a prior art network address translation Clie nt requests directed to any of the multiple domain nam es 
technique for hosting a network sendee on a cluster of 25 c an Uien be serviced by the same server . The server deter- 
servers; mines which d omain name a given requSt is associated w ith 

FIG. 4 illustrates a technique for hostiiig a network by examining the des tmation address in the request packet, 
service on a cluster of servers using routing-based dispatch- pre sent invention utilizes t he ifconfig alias option to 

ing in accordance with an exemplary embodiment of the aUow two server^ to ^harc the, ^mr^g p address. Normally, 
invention; and 30 ^^o servers cannot share the same IP address because such 

FIG. 5 iUustrates a technique for hostir^ a neiworic *° arrangement would cause any packet destined for the 
service on a cluster of servers using broadcast-based dis- shared addre5gt tn he flrfy.ptr.H and responded to by both 
patching in accordance with another exemplary embodiment servers, oonfoang the client and possibly leading to a 
of the invention. connect ion res et. Therefore, before a server is permitted to 

35 attach a new IP address tojtenetwnric infftrfanp.^ a rhf^^v n^ay 
DETAILED DESCRIPTION OF THE *^ ^ ensure that no other server on the same local area 

INVENTION network (LAN) is using that IP address. If a duplicate 

ad dress is founds b oth serygis are informe d and warnings are 

The present mvention will be illustrated below in con- issued. The routiiig-ba^^d or broadcast-based dis^hing of 
junction with exemplary client/ server connections estab- 40 thcpV^sent invention ensures that every packet is processed 
lisbed over the Internet to a server cluster using the Trans- by only one serverofihg_cl uste7rsuch that the above-no ted 
mission Control Protocol/Internet Protocol (TCP/IP) warnings do not create a problem 

standard. U shoidd be underetood, however, that the inven- alternative technique for assigning a secondary 

Uon IS not hmited to use with any particular type of network address to a given server of a server cSster in accordanci 
or network commurucaUon protocol The disclosed tech- 45 with the invenliSHTivolves configuring the given server to 

""^^^ ^ ? ^ '''^'y °^ °»^llipl« ^^'^ interface cank such that a different 
networks and protocols. The term "server chister^ as used address can be assigned to each of the network interface 
herem refers to a group or set of servers interconnected or cards. For example, in a UNK-based system, conventional 
otherwise configpued to h^ a network service. Hie terms ifconfig commands may be used, without the above- 
cluster address- and smgle-address image" refer generaUy 50 described alias option, to assign a primary IP address to one 
to an address associated with a group of servers configured of the networic interface cards and a secondary IP address to 
to support a networic service or services. A «ghosl IP another of the networic interface cards. The secondary IP 
address" is one type of cluster address in the form of an IP address is also assigned as a secondary IP address to the 
address which is not used as a prunary address for any server remaining servers in the cluster, and used as a cluster address 
of a given server cluster. The term "networic service" is 55 for directing client requests to the cluster 
mlended to mclude web sites, Internet sites and data deUvery j^^ exemplary embodiments of the present invention to 
""^^x 1 i"" r ^^^^i^^^fer mechanism be described befow utilize dispatching Eques in wMch 
accessible by a chent over a network THe term "client ^i^^j ^ased o^a hash value of tS cHrnHP 
request" refers to a commumcation from-a client which address. The hash value may be generated VapSS a 
mitutes the network service. A given client request mav ^ *^.u i- « m ^i^^^ ujr appijiu^ a 

• t J , , ^ i_ . , ^ uidy 60 hash function to the client IP address, or by applymg another 

on^Sf oT^tf ^ — v,lu/£.^L'cUeot IP 

-ru! address. For example, given N servers and a padcet from a 

The present invention provkies an improved single- client having a client address CA, a dispatching function in 

address image approach to disuibuiing client requests to accordance with the invention may compute a hash value k 

servers of a server chister. In a preferred embodiment, the 65 as CA mod (N-1) and select server k to process the packet 

mvention aUows all servers of a server chister to share a This ensures that all request or reply packets of the same 

smgle common IP address as a secondary address. The TCP/IP connectmn are directed to the same server in the 
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server duster. A suitable hash fuoctioo may be determined 4, the di^atcber 64 applies a hash fimctioa to the IP address 

by analyzing a distribution of client IP addresses in actual of client 52 and determines that the conesponding request 

access logs associated with the servers such that client packet should be routed to server 54-2 at IP address S2. Tbc 

requests are approximately evenly distributed to all servers. dispatcher 64 then routes the request packet to the server 

When a server in the cluster f^ils, the subset of clients ^ 54-2 over LAN 56, as indicated by the dashed line» using the 

assigned to that server will not be able to connect to it. The primary address S2 of server 54-2 to distinguish it from the 

present invention addresses this potential problem by other servers of cluster 54. After the network interface of 

dynamically modifying the dispatching function upon detcc- server 54-2 accepts the packet, aU higher level processing 

tioD of a server failure. If the bash vahie of a given client IP may be based on the ghost IP address because that is the 

address maps to the failed server, tbe chent IP address is destination address in the packet IP header and possibly in 

rehashed to map to a non-failed server, and the connections the application-layer padcet contents. After processing the 

of the remaining clients are not affected by the faihire. request, the server 54-2 replies directly to the <dient 52 via 

FIG. 4 illustrates a routing-based di^atching technique in router 62 over the established TCP/IP coimection, using the 

accordance with the present invention. Solid lines indicate ghost IP address, and without passii^ through th& dispatcher 

network connections, while dashed hoes show tbe path of an 15 

exemplary client request and the corresponding reply. A It should be noted that when a request packet destined for 
client 52 sends a client request to a server cluster 54 the ghost IP address is received by the network interface of 
inchiding N servers 54-i, i«l, 2, . . . N having IP addresses dispatcher 64 and placed back onto the same network 
SI, S2, . . . SN and interooimectcd by an Ethernet or other interface for delivery to one of the servers 54-i over LAN 56, 
type of LAN 56. The client request is formulated in acoor- 20 cause an Internet control message protocol (I CMP) 
dance with the above-described HTTP protocol, and may host redirect message to be sent to the router 62. This ICMP 
include a URL with a domain name associated with a web message is designed to direct the router- 62 to i^ate its 
site or other network service hosted by the server cluster 54. routing table such that any future packets having the ghost 
The client accesses a DNS to determine an IP address for the IP address can bypass the dispatcher 64 and go directly to the 
domain oame of the service, and then uses the IP address to 2s destination server, as described in greater detail in W. R. 
establish a TCP/IP ooimection for communicating with one Stevens, TCP/IP Illustrated, Vol. 1, Ch. 6, pp. 6^-83, which 
of the servers 54-i of the server cluster 54. In accordance is incorporated by reference herein. However, this effect is 
with the invention, a "ghost" IP address is {Niblidzed to the undesirable in tbe routing technique of FIG. 4 because the 
DNS as a duster address for the server cluster 54. The ghost dispatcher 64 performs the server . selection process as pre- 
IP address is selected sudi that none of the servers 54-i of 30 viously described. It therefore may be necessary to suppress 
cltister 54 has that IP address as its primary address. the ICMP host redirect message for the ghost IP address by, 
Therefore, any request packets directed to the ghost IP for example, removing or altering the corresponding opcr- 
address are associated with chent requests for the service of ating system code in the dispatcher. In the above-mentioned 
the single-address image cluster 54. The use of the gbost [P alternative embodiments in which the di^atchirig function 
address thus distinguishes a network service hosted by tbe 35 is implemented wifhin the router 62, the ICMP redirect 
cluster from activities of the servers 54-i which utilize the message is not generated and therefore need not be sup- 
primary server addresses, and prevents interference with pressed. Another potential problem may arise when a reply 
these primary address activities. packet is sent back to the client 52 from the selected server 
The client 52 uses tbe ghost IP address as a cluster address 54-2 with the ghost IP address, in that it may cause the router 
for directing its request to tbe server chisler 54. The request 40 62 to associate, in its Address Resolution Protocol (ARP) 
is directed over Internet 60 to a router 62 having an IP cache, the ghost IP address with the LAN address of the 
address RA. The router 62 includes a routing table having an selected server. The operation of the ARP cache is described 
entry or record directing any incoming request packets in greater detail in W. R. Stevens, TCPAP Illustrated, M>1. 1, 
having the ghost IP address to a dispatcher 64 connected to Chs. 4 and 5, pp. which is incorporated by reference 
the LAN 56. The dispatcher 64 includes an opera tiijg system 45 herein. The illustrative embodiment of FIG. 4 avoids this 
configured to run in a router mode, using a routing algorithm problem by automatically routing the request packets to the 
which performs the dispatching described herein. In alter- dispatcher 64, and then dispatching based on the server 
native embodiments, the functions of the di^atcher 64 primary IP address, such that tbe router ARP cache is not 
could be incorporated into the router 62 in order to provide used. 

additional efficiency improvements. Each of tbe servers 54-i 50 FIG. 5 illustrates a broadcast-based dispatching technique 
of the chister 54 utilizes the above-described ifconfig alias in accordance with tbe present invention. Again, solid lines 
option to set the gbost IP address as their secondary address. indicate network connections, while dashed lines show the 
As noted above, this technique for setting a secondary path of an exemplary cUent request and the corresponding 
address for each of the servers 54-i generally does t»t reply. As in the FIG. 4 routing-based embodiment, client 52 
require any alteration of the kernel code running on the 55 sends a client request to server cluster 54 including N servers 
servers. In alternative embodiments, one or more of the 54-i, i -1, 2, . . . N connected to LAN 56 and having IP 
servers 54-i may be configured to include multiple network addresses SI, S2, . . . SN. The client 52 uses the above- 
interface cards, as previously noted, such that a different described ghost address as a cluster address for directing its 
^ address can. be .assigned to each of the network interface request to the server cluster.54. The request is directed over 
cards of a given server using a UNIX ifconfig command or 60 Internet 60 to a router 70 having an IP address RA The 
other similar technique. router 70 broadcasts any incoming request packets having 
The router 62 routes any packets having the ghost IP the ghost IP address to the LAN 56 interconnecting the 
address to the dispatcher 64 in accordarux with the above- servers 54-i of the server cluster 54, such that the request 
noted routing table record. Tbe dispatcher 64 then applies a packet is received by each of the servers 54-i. 
hash function to the client IP address in a given request 65 Each of the servers S4-i of the cluster 54 implements a 
packet to determine which of the servers 54-i the given filtering routine in order to ensure that only one of the 
packet should be routed to. In the example illustrated in FIG. servers 54-i processes a given client request. The filtering 
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routine may be added to a device driver of each of the ing is therefore oegligible. Although the additional routing 

servers 54-i. In an exemplary implementation, each of the step for every request packet sent to the ghost IP addrcss 

servers 54-i is assigned a unique identification (\U) number. may increase the traffic in the LAN of the server cluster, the 

The filterti^ routine of a given server 54-i computes a hash size of a request in many important applications is typically 

value of the client IP address and compares it to the ID 5 much smaller than that of the corre^ooding re^onse, which 

number of the given server. If the bash value and the ID is delivered directly to the client without the additioaal 

number do not match, the filtering routine of the given server routing. In the broadcast-based dispatching technique, the 

rejects the packet If the hash value and the ID number do broadcasting of each incoming request packet on the LAN of 

match, the given sender accepts and processes the packet as the server cliister does not substantially increase netwodc 

if it had received the packet through a conventional IP traffic. Although a ha^ value is computed for each incoming 

routing mechanism. In the illustrative example of FIG. 5, a packet having the ghost IP destination address, which 

packet associated with the request firom client 52 is broad- increases the CPU load of eadi server, this additional 

cast by the router 70 to each of the servers 54-i of the server computation overhead is negligible relative to the corre- 

chister 54 over the LAN 56 as previously noted. The filtering sponding communication delay. 

routine of server 54-2 generates a hash value of the client IP 15 Both the routing-based and broadcast-based dispatching 
aiklress which matches the unique ID number associated techniques of the present invention are scalable to support 
with server 14-2, and server 14-2 therefore accepts and relatively large numbers of servers. Although the dispatcher 
processes the packet. The filtering routines of the N-1 other in the routing-based technique could present a potential 
servers 54-i cadi indicate no match between the client IP bottleneck in certain applications, a study in the above-dted 
a(klress and the corresponding SCTver ID number, and there- 20 D. M. Dias et al. reference indicates that a single dispatcher 
fore discard the broadcast packet. The reply packets arc sent can support up to 75 server iKxles, which is sufficient support 
back to the cHcnt 52 via router 70, as indicated by the da^ed for many practical systems. Hie number of servers sup- 
lines, using the ghost IP address. ported may be even faigher with the present invention given 
The broadcast-based dispatching tediniquc of FIG. 5 may that the routing-based dispatching functions described 
be implemented using a pemianent ARP entry within the 2s herein are generally simpler than those in the D. M. Dias et 
router 70, to associate the ghost IP address with the Ethernet al. reference. It should also be noted that additional scal- 
er other local network broadcast address associated with ability can be (Stained by combining the routing-based 
LAN 56 of the cluster 54. A potential problem is that any dispatching of the present invention with a DNS round-robin 
reply packet from a selected server appears to be coming technique. For example, a DNS server may be used to map 
from the ghost IP address, and may therefore cause the 30 a domain name to one of a number of different ghost IP 
router 70 to overwrite the entry in its ARP cache such that addresses bek)nging to different server clusters using a 
the ghost IP address is associated with the LAN address of round-robin technique. In the broadcast-based di^atching 
the selected server. Hiis potential problem may be addressed technique, there is no potential di^atching bottleneck, 
by setting up a routing table entry in the router 70 to direct although the device drivers or other portions of the servers 
all packets having a ghost IP destination address to a second 35 may need to be modified to provide the above-described 
ghost IP address which is a legal subnet address in the LAN filtering routinesL 

56 of the server cluster 54 but is not used by any server. In The routing-based and broadcast-based dispatching of the 
addition, an entry is inserted in the ARP cache of the router present invention can also provide load balancing and f ailuire 
70 to associate the second ghost IP address with the broad- handling capabilities. For example, given N servers and a 
cast address of the LAN 56. When the router 70 routes a 40 packet from client address CA, the above-described routing- 
packet to the second ghost IP address, it will then acmally based dispatching function may compute a hash value k as 
broadcast the packet to each of the servers 54-i of the cluster CA mod (N-1) and select server k to process the packet. 
54. Since no reply packet is sent from the second ghost IP More sophisticated di^atchii^ functions can also be used, 
address, the corresponding entry of the router ARP cache and may involve analyzing the actual service access log to 
will remain unchanged. Another potential problem is that 45 provide more effective load balancing. In order to detect 
some operating systems, such as the NetBSD operating failures, each server may be monitored by a watdidog 
system, do not allow a TCP packet to be processed if it is daemon such as the watchd daemon described in greater 
received from a broadcast address. This potential problem detail in Y. Huang and C. Kintala, "Software Implemented 
may be avoided by a suitable modification to the broadcast Fauh Tolerance: Technologies and Experience," Proceed- 
address in the LAN packet header attached to the packet. 50 ings of the 23"^ International Symposium on Fault -Tolerant 
The routing-based and broadcast-based dispatching tech- Computing — FTCS, Toulouse, France, pp. 2-9, June 1993, 
niques described in conjunction with FIGS. 4 and 5 above which is incorporated by reference herein. When a server 
have been implemented 00 a cluster of Sun SPARC work- fails, the corresponding watchd daemon initiates a change of 
stations. The NetBSD operating system, as described in the dispatching fiinction to mask the faihire and rebalance 
NetBSD Project, <http:/Avww. NetBSD.org>, was used to 55 the load. A system call interface may be implemented to 
provide any needed kernel code modifications. The dis- allow the di^atching function to be changed while the 
patchiiig overhead associated with both techniques is mini- servers remain on-line. In routing-based dispatching, the 
— inal-becausc the-packet^dispatc±ing is based on simple IP — -watchd daemon may-notify the- dispatcher to change the 
address hashing, without the need for storing or searching dispatching function, while in broadcast-based dispatching, 
any address-mapping information. In the routing-based dis- 60 all servers may be notified to modify their filt^ing routines, 
patchir^ technique, the additional routing step in the dis- For example, if a server k fails, the new di^tching function 
patcher 64 typically adds a delay of about 1 to 2 msecs to the may check to see if the hash value CA mod N equals k. If 
TCP round-trq) time of each incoming request packet. A it does, a new hash value j>CA mod (N-1) is computed. If 
smdy in W. R. Stevens, TCP/IP Illustrated, Volume 3, pp. j is less than k, the packet goes to server j. Otherwise, the 
185-186, which is incorporated by reference herein, indi- 65 packet goes to server j+l. This technique does not affect the 
cates that the median TCP round-trip time is 187 msecs. The clients of non-failed servers, reassigns the clients of the 
additional delay attributable to the routing-based dispatch- failed server evenly to the remaining servers, and can be 
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readily extended to handle multiple server failures. Addi- aj;Rigning the common address for the at least one server 

tional servers cao be added to the cluster without bringing using an ifbonfig commaiid. 

down the service by changing the di^atching fuoction from 5. The method of claim 1 wherein the plurality of servers 

CA mod N to CA mod (N+1). are UNIX-based servers, and the assigning step includes 

In routing-based dispatching, the dispatcher may become 5 assigning the common addr^ utiliziijg 

a single point of failure, and therefore should also be ''T''^''' ""f^^' f SLSL^.? ^^^^ ^ 

monitored by a watchd daemon or other suitable failure - ^; fl^^T^^ ^'^^ ^^T^ 1?" 

. . ' . . ... TM 1- mcludes the ^ep of dispatchmg a request of a given chent to 

momtormgmechamsm. Upon detecnng a failure, the watchd of^rvers based on appUcSon of a hash 

daemon may trigger a transfer of the dispaldnng fiinctoon Unction to an IP address of the given dSenL 
from the primary di^atcher to a backup di^atcher, and then 10 7 j^c method of claim 6 wherein the hash function is 

direct the router to change the entry in its routing table such determined based on an analysis of a distribution of client IP 

that future incoming request packets are routed to the addressesinanaccesslogassociated withoneormoreof the 

backup dispatdier. Since no mapping table is maintained by servers. 

the primary dispatdier, this approach is substantially state- 8. The method of rlaim 6 wherein the di^atctung step 

less. Proper routing may be ensured by simply utilizing includes replying the hash function to the client IP address 

consistent routing fiuxrtions in the primary and backup to identify anoUier server if a server identified as a result of 

dispatchers, without the substantial additional costs associ- a previous application of the bash function has failed, 

ated with mapping-based approadies. 9. The me±od of claim 1 wherein the processing step 

The use of the ifoonfig alias option or other similar inchides the steps of: 
technique to provide a single-address image for a server ^ routing cUent requests directed to the common address to 

cluster provides a number of advantages over the oonven- a dispatcher connected to a local network associated 

tional techniques described previously. For example, it with the pturaUty of servers; and 

avoids the need to diange the destination address in a selecting a particular one of the servers to process a given 

request packet header so that a particular server can accept client request based 00 application of a hash function to 

the request, and the need to change the source address in a ^ a corresponding cUent address in the dispatcher, 

reply packet header to the cluster address so that the clirat 10. The method of claim 1 wherein the processing step 

can accept the reply. M^tb the singk-addrcss image approach incluctes the steps of: 

of the present invention, all servers can accept and respond broadcasting a given client request directed to the corn- 
to packets having the cluster address, so that the addresses mon address to each of the plurahty of servers over a 
in the request and reply packet headers do not need to be ^ local network associated with the servers; and 
modified. Since the single-image approach of the present implementing a filtering routine in each of the plurality of 
invention does not require alteration of the packet addresses, servers so that the given client request is prtxxssed by 
it is suitable for use with a wide variety of protocols, only one of the servers. 

inchiding those protocols which utilize IP addresses within The method of claim 10 wherein the implementing 

an aR)hcation program. In addition, the single-address step includes the steps of: 

image approach of the present invention does not require a applying a hash function to a client IP address associated 

router to store or to search through a potentially large ^^^^ request; and 

number of IP address mappings in order to determine which comparing the rcsuU of the applying step to an identifier 

cluster server should receive a request padcet. The mvention ^ particuUr server to determine whether that server 

thus efifective ly removes the possibihty that the router may ^^^^ ^ ^^^^^ ^ 

become a bottieneck under heavy load conditions. ^ ^ ^^^^^^^ ^^^^ ^ ^ ^^^^ 

The above-described embodiments of the invention are servers configured to support a network service over a 

intended to be illustrative only. Numerous alternative communication networic, each of the servers having a pri- 
embodimcnts may be devised by those skilled in the art ^5 mary address, the apparatus comprising: 

without departing from the scope of the foUowing claims. assigning a common address as a secondary 

What is claimed is: address for each of the plurality of servers such that 

1. A method of routing chent requests to a plurality of ^^ch of the servers individuaUy has the secondary 
servers configured to support a network service over a address* and 

communication netwoik, each of the serveis having a pri- for'procesang client requests directed to the com- 

mary address, the method composing the steps of: ^^^^ ^^^^ processed 

assigning a conunon address as a secondary address for by a particular one of the plurality of servers without 

each of the plurality of servers such that each of the modification of the common address in the request 

serveis individually has the secondary address; and 13. The apparatus of claim 12 wherein the network 
processing cdient requests directed to the common address 55 utilizes a TCP/IP protocol and the primary aixl secondary 

such that each of the requests is processed by a par- addresses are primary and secondary IP addresses, respec- 

ticular one of the plurahty of servers without modifi- lively. 

cation of the common address in the request. 14. The apparatus of claim 13 wherein the common 

2. The method of claim 1 wherein the network utilizes a address is an IP address which does not correspond to a 
TCP/IP protocol and the primary and secondary addresses 60 primary IP address of any of the plurality of servers. 

are primary and secondary IP addresses, respectively. 15. The apparatus of claim 12 wherein at least one of the 

3. The method of claim 2 wherein the common address is plurality of servers is a UNIX-based server including mul- 
an IP address wtiich does not corTe^>ond to a primary IP tiple network interface cards, and the assigning means is 
address of any of the plurahty of servers. operative to assign the common address for the at least one 

4. The method of claim 1 wherein at least one of the 65 server using an ifconfig command. 

plurahty of servers is a UNIX-based server including mul- 16. Tbe apparatus of claim 12 wherein the plurality of 

tiple network inter&ce cards, and the assigning step includes servers are UNIX-based servers, and the assigning means is 
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Operative to assign the common address utilizing an ifconfig 
alias option for at least a subset of the plurality of servers. 

17. The apparatus of claim 12 wherein the processing 
means is operative to di^atcfa a request of a given client to 
one of the plurality of servers based on application of a hash 
function to an IP address of the given client. 

18. The apparatus of claim 17 whoein the hash function 
is determined based on an analysts of a distribution of client 
IP addresses in an access log associated with one or more of 
the servers. 

19. The apparatus of claim 17 wherein the processing 
means is further operative to reapply the hash fimction to the 
client IP address to identify another server if a server 
identified as a result of a previous application of the hash 
function has failed. 

20. The apparatus of claim 12 wherein the processing 
means further includes a di^atcber connected to a local 
network asscxnated with the plurality of servers, wherein the 
dispatcher is operative to receive client requests directed to 
the common address, and to select a particular one of the 
servers to process a given client request based on appUcatioo 
of a hash function to a corre^)onding client address. 

21. The apparatus of claim 12 wherein the processing 
means further includes: 

means for broadcasting a given dieat request directed to 
the common address to each of the plurality of servers 
over a local network associated with the servers; and 

means for filtering the given client request in each of the 
plurality of servers so that the given client request is 
processed by only one of the servers. 

22. The apparatus of claim 21 wherein the filtering means 
is operative to apply a hash function to a client IP address 
associated with the given client request, and to compare the 
result of the applying step to an identifier of a particular 
server to determine wtiether that server should process the 
given client request. 

23. An apparatus for routing client requests for a network 
service over a communication network, the apparatus com- 
prising: 

a plurality of servers configured to support the network 
service, each of the servers having a primary address 
and a secondary address^ wherein a common address is 
assigned as the secondary address for each of the 
plurality of servers such that each of the servers indi- 
vidually has the secondary address; and 

a router coupled to the servers and operative to route 
client requests directed to the common address such 
that each of the requests is processed by a particular one 
of the plurality of servers without modification of the 
coQomon address in the request. 

24. The apparatus of claim 23 wherein the network 
utilizes a TCP/IP protocol and the primary and secondary 
addresses are primary and secondary IP addresses, respec- 
tively. 

25. The apparatus of claim 24 wherein the common 
address is an IP address which does not correspond to a 
primary IP address of any of the plurality of servers. 

26. The apparatus of claim 23 wherein at least one of the 
pltirality of servers is a UNIX-based server including mul- 
tiple network interface cards, and the common address is 
assigned for the at least one server using an ifconfig com- 
mand. 

27. The apparatus of claim 23 wherein the plurality of 
servers are UNIX-based servers, and the common address is 
assigned as the secondary address of the phirality of servers 
by utilizing an ifconfig alias cation for at least a subset of the 
plurality of servers. 



28. The apparatus of claim 23 wherein the router is further 
operative to route cUent requests such that a request of a 
given chent is routed to one of the plurality of servers based 
on application of a hash function to an IP address of the 

5 given client. 

29. The apparatus of claim 28 wherein the hash function 
is determined based on an analysis of a distribution of client 
IP addresses in an access log associated with one or more of 
the servers. 

10 30. The apparatus of claim 28 wherein the bash function 
is reapplied to the client IP address to identify another server 
if a server identified as a result of a previous application of 
the hash function has failed. 

31. Ibe apparatus of claim 23 further including a dis- 
15 patcher coupled to the router and to a local netwoik asso- 
ciated with the plurality of servers, ^ch that the router 
directs client requests having the common address to the 
dispatcher, and the dispatcher selects a particular one of the 
servers to process a given client request based on application 

20 of a hash function to a corresponding client address. 

32. The apparatus of claim 23 wherein the router is further 
operative to broadcast a given client request directed to the 
common address to each of the plurality of servers over a 
local network associated with the servers, and further 

25 wherein each of the servers implements a filtering routine so 
that the given client request is processed by only one of the 
servers. 

33. The apparatus of claim 32 wherein the filtering routine 
involves applying a hash function to a dient IP address 
associated with the given client request, and comparing the 
result to an identifier of a particular server to determine 
whether that server should process the given client request. 

34. A method of routing dient requests to a plurality of 
servers configured; to support a netwoik service over a 
communication network, each of the servers having a pri- 
mary address, the metlKxl comprising the steps of: 

assigning a common address as a secondary address for 
each of the plurality of servers such that each of the 
servers individually has the secondary address; and 
processing client requests directed to the common address 
such that each of the requests is processed by a par- 
ticular one of the plurality of servers without modifi- 
cation of the common address in the request, wherein 
each of the servers provides access to substantially the 
same set of contents associated with the network ser- 
vice. 

35. An apparatus for routing client requests to a plurality 
of servers configured to support a network service over a 
communication network, each of the servers having a pri- 
mary address, the apparatus comprising: 

means for agsigning a common address as a secondary 
address for eadi of the plurality of servers such that 
each of the servers individually has the secondary 
address; and 

means for processing client requests directed to the com- 
mon address such that eadi of the requests is processed 

by a particular_one_of the pluraUty^of servers without 

modification of the common address in the request, 
50 wherein each of the servers provides access to substan- 
tially the same set of contents associated with the 
network service. 

36. An apparatus for routing client requests for a network 
service over a oommutucaUon network, the apparatus com- 

ss prising: 

a plurality of servers configured to support the network 
service, each of the servCTS having a primary address 
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aod a secondary address, wherein a common address is 
assigned as the secondary address for each of the 
plurality of servers sudi that each of the servers indi- 
vidually has the secondary address, and wherein each 
of the servers provides access to substantially the same 
set of contents associated with the network service; and 
a router coupled to the servers and operative to route 
client requests directed to the common address such 
that each of the requests is processed by a particular one 
of the plurality of servers without modification of the 
common address in the request. 

37. A method of routing client requests to a plurality of 
servers configured to siipport a network service over a 
commimication network, each of the servers having a pri- 
mary address, the method comprising the steps of: 

assigning a common address as a secondary address for 
each of the plurality of servers such that each of the 
servers individually has the secondary address; 

broadcasting a given client request directed to the com- 
mon address to each of the plurality of servers; and 

implementing a filtering routirae in each of the plurality of 
servers so that the given client request is processed by 
only one of the servers without modification of the 
common address in the request. 

38. An apparatus for routing client reque^ to a plurality 
of servers configured to support a network service over a 
communication network, each of the servers having a pri- 
mary address, the apparatus comprising: 

means for assigning a common address as a secondary 
address for each of the plurality of servers such that 
each of the servers individually has the secondary 
address; 
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means for broadcasting a given client request directed to 
the common address to each of the plurality of servers; 
and 

means for filtering the given chent request in each of the 
plurahty of servers so that the given client request is 
processed by only one of the servers without modifi- 
cation of the common address in the request. 
39. An a^^aratus for routing client requests for a network 
service over a communication network, the apparatus com- 
prising: 

a plurality of servers configured to support the network 
service, each of the servers having a primary address 
and a secondary address, wherein a common address is 
assigned as the secondary address for each of the 
plurality of servers such that each of the servers indi- 
vidually has the secondary address; and 

a router coupled to the servers and operative to route 
client requests directed to the oonmion address such 
that each of the requests is processed by a particular one 
of the plurality of servers, wherein the router is further 
operative to broadcast a given client request directed to 
the common address to each of the plurality of servers, 
and further wherein each of the servers implements a 
filtering routine so that the given client request is 
processed by only one of the servers without modifi- 
cation of the common address in the request 
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